28 research outputs found

    Universal power law behaviors in genomic sequences and evolutionary models

    Full text link
    We study the length distribution of a particular class of DNA sequences known as 5'UTR exons. These exons belong to the messanger RNA of protein coding genes, but they are not coding (they are located upstream of the coding portion of the mRNA) and are thus less constrained from an evolutionary point of view. We show that both in mouse and in human these exons show a very clean power law decay in their length distribution and suggest a simple evolutionary model which may explain this finding. We conjecture that this power law behaviour could indeed be a general feature of higher eukaryotes.Comment: 15 pages, 3 figure

    MiREx: mRNA levels prediction from gene sequence and miRNA target knowledge

    No full text
    Abstract Messenger RNA (mRNA) has an essential role in the protein production process. Predicting mRNA expression levels accurately is crucial for understanding gene regulation, and various models (statistical and neural network-based) have been developed for this purpose. A few models predict mRNA expression levels from the DNA sequence, exploiting the DNA sequence and gene features (e.g., number of exons/introns, gene length). Other models include information about long-range interaction molecules (i.e., enhancers/silencers) and transcriptional regulators as predictive features, such as transcription factors (TFs) and small RNAs (e.g., microRNAs - miRNAs). Recently, a convolutional neural network (CNN) model, called Xpresso, has been proposed for mRNA expression level prediction leveraging the promoter sequence and mRNAs’ half-life features (gene features). To push forward the mRNA level prediction, we present miREx, a CNN-based tool that includes information about miRNA targets and expression levels in the model. Indeed, each miRNA can target specific genes, and the model exploits this information to guide the learning process. In detail, not all miRNAs are included, only a selected subset with the highest impact on the model. MiREx has been evaluated on four cancer primary sites from the genomics data commons (GDC) database: lung, kidney, breast, and corpus uteri. Results show that mRNA level prediction benefits from selected miRNA targets and expression information. Future model developments could include other transcriptional regulators or be trained with proteomics data to infer protein levels

    Mathematical Modelling of Molecular Pathways Enabling Tumour Cell Invasion and Migration

    No full text
    International audienceUnderstanding the etiology of metastasis is very important in clinical perspective, since it is estimated that metastasis accounts for 90% of cancer patient mortality. Metastasis results from a sequence of multiple steps including invasion and migration. The early stages of metastasis are tightly controlled in normal cells and can be drastically affected by malignant mutations; therefore, they might constitute the principal determinants of the overall metastatic rate even if the later stages take long to occur. To elucidate the role of individual mutations or their combinations affecting the metastatic development, a logical model has been constructed that recapitulates published experimental results of known gene perturbations on local invasion and migration processes, and predict the effect of not yet experimentally assessed mutations. The model has been validated using experimental data on transcriptome dynamics following TGF-ÎČ-dependent induction of Epithelial to Mesenchymal Transition in lung cancer cell lines. A method to associate gene expression profiles with different stable state solutions of the logical model has been developed for that purpose. In addition, we have systematically predicted alleviating (masking) and synergistic pairwise genetic interactions between the genes composing the model with respect to the probability of acquiring the metastatic phenotype. We focused on several unexpected synergistic genetic interactions leading to theoretically very high metastasis probability. Among them, the synergistic combination of Notch overexpression and p53 deletion shows one of the strongest effects, which is in agreement with a recent published experiment in a mouse model of gut cancer. The mathematical model can recapitulate experimental mutations in both cell line and mouse models. Furthermore, the model predicts new gene perturbations that affect the early steps of metastasis underlying potential intervention points for innovative therapeutic strategies in oncology

    Conceptual and computational framework for logical modelling of biological networks deregulated in diseases

    No full text
    International audienceMathematical models can serve as a tool to formalize biological knowledge from diverse sources, to investigate biological questions in a formal way, to test experimental hypotheses, to predict the effect of perturbations and to identify underlying mechanisms. We present a pipeline of computational tools that performs a series of analyses to explore a logical model’s properties. A logical model of initiation of the metastatic process in cancer is used as a transversal example. We start by analysing the structure of the interaction network constructed from the literature or existing databases. Next, we show how to translate this network into a mathematical object, specifically a logical model, and how robustness analyses can be applied to it. We explore the visualization of the stable states, defined as specific attractors of the model, and match them to cellular fates or biological read-outs. With the different tools we present here, we explain how to assign to each solution of the model a probability and how to identify genetic interactions using mutant phenotype probabilities. Finally, we connect the model to relevant experimental data: we present how some data analyses can direct the construction of the network, and how the solutions of a mathematical model can also be compared with experimental data, with a particular focus on high-throughput data in cancer biology. A step-by-step tutorial is provided as a Supplementary Material and all models, tools and scripts are provided on an accompanying website: https://github.com/sysbio-curie/Logical_modelling_pipeline

    Classification of gene signatures for their information value and functional redundancy

    No full text
    Data-driven signature classification An informative collection of gene signatures for transcriptomic data analysis is constructed. The number of transcriptomic signatures grows fast and their collections are highly redundant that hampers omics data analyses interpretation. A computational biology team from Institut Curie led by Andrei Zinovyev selected a collection of 962 gene signatures shown to be informative for cancer studies and reflecting mechanisms of cancer progression. The signatures were filtered from a large compendium without requiring any manual curation by experts through a large-scale unbiased analysis of pancancer data. They have much higher chance to obtain significant enrichment scores in a comparative trancriptomic study. The authors integrated the 962 signatures into InfoSigMap, a new data visualization resource for the interpretation of the results of omics data analyses, which facilitates getting an insight into the mechanisms driving cancer

    Antagonism pattern detection between microRNA and target expression in Ewing's sarcoma.

    Get PDF
    MicroRNAs (miRNAs) have emerged as fundamental regulators that silence gene expression at the post-transcriptional and translational levels. The identification of their targets is a major challenge to elucidate the regulated biological processes. The overall effect of miRNA is reflected on target mRNA expression, suggesting the design of new investigative methods based on high-throughput experimental data such as miRNA and transcriptome profiles. We propose a novel statistical measure of non-linear dependence between miRNA and mRNA expression, in order to infer miRNA-target interactions. This approach, which we name antagonism pattern detection, is based on the statistical recognition of a triangular-shaped pattern in miRNA-target expression profiles. This pattern is observed in miRNA-target expression measurements since their simultaneously elevated expression is statistically under-represented in the case of miRNA silencing effect. The proposed method enables miRNA target prediction to strongly rely on cellular context and physiological conditions reflected by expression data. The procedure has been assessed on synthetic datasets and tested on a set of real positive controls. Then it has been applied to analyze expression data from Ewing's sarcoma patients. The antagonism relationship is evaluated as a good indicator of real miRNA-target biological interaction. The predicted targets are consistently enriched for miRNA binding site motifs in their 3'UTR. Moreover, we reveal sets of predicted targets for each miRNA sharing important biological function. The procedure allows us to infer crucial miRNA regulators and their potential targets in Ewing's sarcoma disease. It can be considered as a valid statistical approach to discover new insights in the miRNA regulatory mechanisms
    corecore